Population statistics cannot be used for reliable individual prediction
نویسنده
چکیده
It is known that predictions about individuals from statistical data about the population are in general unreliable. However, the size of the problem is not always realised. For a number of ways of predicting information about one variable from another with which it is correlated, we compute the reliability of such predictions. For the bivariate normal distribution, we demonstrate that unless the correlation is at least 0.99, not even the sign of a variable can be predicted with 95% reliability in an individual case. The other prediction methods we consider do no better. We do not expect our results to be substantially different for other distributions or statistical analyses. Correlations as high as 0.99 are almost unheard of in areas where correlations are routinely calculated. Where reliable prediction of one variable from another is required, measurement of correlations is irrelevant, except to show when it cannot be done.
منابع مشابه
Optimal Non-Parametric Prediction Intervals for Order Statistics with Random Sample Size
In many experiments, such as biology and quality control problems, sample size cannot always be considered as a constant value. Therefore, the problem of predicting future data when the sample size is an integer-valued random variable can be an important issue. This paper describes the prediction problem of future order statistics based on upper and lower records. Two different cases for the ...
متن کاملApplication of Gene Expression Programming to water dissolved oxygen concentration prediction
This research based on record and collected data from four stations at Eymir Lake, Turkey, which are monitored daily in seven months. Water quality monitoring using former methods are time-needed and expensive, while the application of gene expression programming is more understandable, rapid, and reliable which is used in this article to provide a prediction for dissolved oxygen. The concentra...
متن کاملپایایی و روایی آزمون ترسیم ساعت در سالمندان
Objectives: Early diagnosis of cognitive disorders in order to initiate new efficient treatments in time is an important task which cannot be fulfilled without proper cognitive screening tools. The Clock Drawing Test (CDT) is a simple inexpensive cognitive screening tool which can be used in primary care settings delivering health services to older people. The aim of this study was to assess va...
متن کاملHybrid Method of Logistic Regression and Data Envelopment Analysis for Event Prediction: A Case Study (Stroke Disease)
Abstract Predictive analytics is an area of statistics that deals with extracting information from data and using it to predict trends and behavior patterns. Many mathematical modeling has been developed and used for prediction, and in some cases, they have been found to be very strong and reliable. This paper studies different mathematical and statistical approaches for events prediction. The ...
متن کاملPerformance Prediction of a Flexible Manufacturing System
The present investigation presents a stochastic model for a flexible manufacturing system consisting of flexible machine, loading/unloading robot and an automated pallethandling device. We consider unreliable flexible manufacturing cell (FMC) wherein machine and robot operate under individual as well as common cause random failures. The pallethandling system is completely reliable. The pallet o...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2007